Data Science with Pandas, ScikitLearn and Altair 0: Introduction

Introduction In each of this series of posts I will tackle some problem that involves data processing and analysis. I'm going to ivestigate the data, visualise it and possibly perform some predictive analysis with machine learning. The aim is to demonstrate the skills and tools involved at the same time as doing something useful or interesting.

The Tools I'll be using Google Colab as my IDE and running python3 as my programming language and making use of the following libraries * numpy for dealing with vectors and arrays. * pandas for manimulating data. * altair for visualisation. * scikit learn for training machine learning models. * scipy to provide numerical analysis algorithms

Most notebooks will start with the following code to import the tools. The last line allows Google colab to open files from the local envirnment.

import io, math
import numpy as np
import altair as alt
import pandas as pd
import sklearn as skl
import scipy as scp
from google.colab import files

The first few posts will focus on problems with an educational theme as that is my background.

Comments